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Abstract 

The paper presents a novel algorithm for object 
classification in videos based on improved support 
vector machine (SVM) and genetic algorithm. One of 
the problems of support vector machine is selection of 
the appropriate parameters for the kernel. This has 
affected the accuracy of the SVM over the years. This 
research aims at optimizing the SVM Radial Basis 
kernel parameters using the genetic algorithm. 
Moving object classification is a requirement in smart 
visual surveillance systems as it allows the system to 
know the kind of object in the scene and be able to 
recognize the actions the object can perform. 

1 Introduction 

Object classification in videos is the process of 
recognizing the classes of objects detected in videos. 
It is an important requirement in surveillance systems 
as it aids understanding of the intentions or actions 
that the object can perform. For instance human 
beings can sit, walk, run or fall while vehicles can 
move, run, over-speed or crash. Object classification 
is a challenging task because of various object poses, 
illumination and occlusion [2], 

Recently, many research works have been carried out 
in literature on object classification in videos. 
Heikkila and Silven, [7], presents a real-time system 
for monitoring of cyclists and pedestrians. The 
classification algorithm adopted is learning vector 
quantization however, the classification accuracy 
obtained is low. The classification rate is low. The 
authors in [5] classified moving object blobs into 
general classes such as ‘humans’ and ‘vehicles’ using 
neural networks. Each neural network is a standard 
three-layer network. Learning in the network is 
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accomplished using the back propagation algorithm. 
Input features to the network are a mixture of image- 
based and scene based object parameters namely 
image blob dispersednes (perimeter 2 /area (pixels)); 
image blob area (pixels); apparent aspect ratio of the 
blob bounding box; and camera zoom. There are three 
output classes, namely human, vehicle and human 
group. This approach fails to discriminate object with 
similar dispersednes. The authors in [9] used Artificial 
Neural Network approach for the classification of 
human motion on a still camera. It is noted that task to 
classify and identify objects in the video is difficult 
for human operator. Object is detected using 
background subtraction technique. The detected 
moving object is divided into 8x8 non-overlapping 
blocks. The mean of each of the blocks is calculated. 
All mean value is then accumulated to form a feature 
vector. A neural network is trained using the 
generated feature vectors. Experiment performed 
shows a good classification rate but the object 
detection algorithm used cannot work under a quasi- 
stationary background. Not only this, the 
computational time of the features is time consuming 
which is a problem to surveillance systems. In [2], a 
neuro-genetic model for moving object classification 
in videos is presented. A genetic model is used to 
obtain optimum weights of a neural network. The 
optimum weights are used later by the multilayer 
feed- forward neural network model to classify objects 
as human or vehicle. The model is compared with a 
neural network trained using back-propagation 
algorithm on a set of objects detected from real life 
videos. The neuro-genetic model outperforms with 
classification rate of 99.09% while the back- 
propagation neural network achieves the classification 
rate of 98.5% . 

In [1 1] SVM is used to recognize facial expressions in 
videos. Automatic facial feature trackers are used to 
locate faces in videos. Features are then extracted 
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which are then supplied to SVM to recognize facial 
expression. The classification accuracy shows that 
SVM is capable of recognizing facial expressions. In 
[12], SVM is used to recognize detected objects in 
video for tracking purposes. A simple background 
subtraction technique is used to extract the object. 
Moment features of the detected objects are calculated 
and fed into SVM for classification. In [11] and [12] 
more classification accuracy could have been 
achieved by optimizing the parameters of the SVM. 
The hybrid of GA and SVM have been used in [13] 
for classification of satellite images. There is the need 
to improve the object classification in video 
surveillance applications. 

Recently SVM have been reported as an efficient 
classifier. SVM is based on the statistical learning 
method based on structural risk minimization instead 
of the empirical risk minimization to improve the 
generalization ability of a model. It is however 
realized that the selection of appropriate kernel and 
kernel parameters can have great influence on the 
performance of the model [13]. The approach that 
have been used for optimizing the hyper-parameters 
of the SVM is the grid search which is always time 
consuming and does not perform well [13]. In this 
paper, a genetically optimized support vector machine 
is presented for human object classification in videos. 

2 Support Vector Machines (SVM) 

Support Vector Machines are a set of supervised 
learning methods used in classification and 
regression. This machine learning technique was 
proposed by Vapnik in 1995. The classification 
problem can be restricted to consideration of the two- 
class problem without loss of generality. In this 
problem, the goal is to separate the two classes by a 
function which is induced from available examples 
[16]- 

The motivation for SVM is to create a classifier that 
will work well on unseen examples, that is, 
generalizes well. The objective is to create a classifier 
that uses the structural minimization principle as 
against those that use empirical error minimization 
principle. Consider the example in Figure 1. There 
are many possible linear classifiers that can separate 
the data, but there is only one that maximizes the 


margin (maximizes the distance between it and the 
nearest data point of each class). This linear classifier 
is termed the optimal separating hyper-plane. 
Intuitively, this boundary is expected to generalize 
well as opposed to the other possible boundaries. 

Let m-dimensional training input vectors, x; 
(i=l,...,m) belong to class 1 or 2 and the associated 
labels be y; =1 for class 1 and -1 for class 2. If these 
data are linearly separable, a decision function 
satisfying Equation 1 can be constructed. 

D(x) = w T x i +b 

where w is an m-dimensional vector and b is a bias 
term. 

If the training data is linearly separable, then no 
training data satisfy: 

w T x t + b = 0 (2) 

This hyper-plane should have the best generalization 
capability. As shown in Figure 1, the +1 and the -1 are 
the training dataset which belong to two classes. The 
plane H series are the hyperplanes that separate the 
two classes. The optimal plane H is found by 
maximizing the margin value 2/||w||. Hyperplanes 

H l and H 2 are the planes on the border of each class 
and also parallel to the optimal hyper-plane H. The 
data located on H l and H 2 are called support vectors 

[ 83 - 



Figure 1 The SVM Binary Classification 
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Given a set of training samples Xj,x 2 ,...,x m 

with labels y t e {— 1,4-1} a hyper-plane 

? 

w.x + b separates the data into classes such 
that: 

w.x ; . +b > 1 if y t =1 
w.X; + b <; -1 if y t = -1 ( 4 ) 

Vi. 


These constraints can be expressed in compact form 
as: 

y,.(w.X;+6) > 1 

( 5 ) 

which can be written as: 

( 6 ) 

y,(w.X; 4-z>) — l > o 


It has been shown that if no hyper-plane exists 
(because the data is not linearly separable), a penalty 
terms ^ is added to account for misclassifications. 

This can be translated to the following minimization 
problem: 

minimize: |w| 2 / 2 + C^. ^ 

(?) 


where C, the capacity, is a parameter which allows us 
to specify how strictly the classifier can fit the 
training data. This can be translated into the following 
dual problem: 


maximi z e 


ize: 


a i a ] y l y j x r x j 


hj 


( 8 ) 

subject to: 0<a t <C 


= ° 


which can be solved by standard techniques of 
quadratic programming. The vector w that defines the 
optimum separating hyperplane is obtained by: 


(9) 

The threshold b of the optimal separating 
hyper-plane is obtained by: 


Y j a i y i x r x j +b = y t 


7=1 


( 10 ) 


b = y i -YjOc i y i x i -x j ( 11 ) 

7=1 

The prediction of new patterns is given by 

( m A 

Class(X;) = sign '^ j cc i y i x i ■ x k +b (12) 

V i=l J 


The training samples for which the Lagrange 
multipliers are non-zero are called support vectors. 
Samples for which the corresponding Lagrange 
multiplier is zero can be removed from the training set 
without affecting the position of the final hyperplane. 
The classification framework outlined above is 
limited to linear separating hyperplanes. It is possible 
however to use a non-linear hyper-plane by first 
mapping the sample points into a higher dimensional 
space using a non-linear mapping. That is, by 

choosing a map (j ) : 31" —»(/ where the dimension of 
(// is greater than n. A separating hyper-plane is then 
found in the higher dimensional space. This is 
equivalent to a non-linear separating surface in jR" . 
The data only ever appears in our training problem in 
the form of dot products, so in the higher dimensional 
space the data appears in the form ^(X;).^(x y ) . If the 

dimensionality of y/ is very large, this product could 
be difficult or expensive to compute. However, by 
introducing a kernel function such that: 

K{x i ,x ] ) = (t){x i ).(t){x j ) (13) 


Equation 13 can be used in place of x i .x j everywhere 


in the optimization problem and never need to know 
explicitly what (j) is. The development of a SVM 
classification model depends on the selection of 
kernel function K. There are several kernels that can 
be used in SVM models. These include linear, 
polynomial, Radial Basis Function (RBF) and 
sigmoid function. The Radial Basis Kernel is given 
by: 


*&,*,)= ex p\-^\\X-Xj[ 


(14) 


The RBF is by far the most popular choice of kernel 
types used in SVM. This is mainly because of their 
localized and finite responses across the entire range 
of the real x-axis. After solving for w and b, the class 
a test vector x ( belongs to is determined by 

evaluating w.x f +b or w.^(X;) + b if a transform to a 
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higher dimensional space has been used. It can be 
shown that the solution for w is given by: 

( 15 ) 

i 

Therefore, w .</>(\ t ) +b can be rewritten thus: 

Z ) • ^(x t ) + b = Z a i y i K(x i ,x t ) + b ( 16 ) 

Thus, the Kernel function can be used rather than 
actually making the transformation to higher 
dimensional space since the data appears only in dot 
product form. The a and C are the free hyper- 
parameters the users can supply. These two 
parameters has a great role to play in the performance 
of the SVM. Several choices have used to select 
optimal values of these parameters. These include 
cross-validation, Particle swarm optimization and 
genetic optimization. In this paper, however, the 
genetic program approach is employed. 


3 Genetic Algorithm 

3 . 1 Genetic Algorithms 

Genetic Algorithm (GA) was developed by John 
Holland in 1975. The algorithm is based on the 
mechanics of the natural selection process (biological 
evolution). The concept of the GA is that the strong 
tends to adapt and survive while the weak tends to die 
out [17] The technique is about generating a random 
initial population of individuals, each of which 
represents a potential solution to a problem [14], The 
process begins by coding the genes in a form that 
represents the solution to the particular problem. In 
Holland’s Genetic Algorithm, each feasible solution is 
encoded as a chromosome (string of zeros and ones) 
also called a genotype. Then initial population size is 
specified and a number of chromosomes of the 
population size are then created. Optimization is 
based on evolution, and the "Survival of the fittest" 
concept. Each chromosome is given a measure of 
fitness via a fitness (evaluation or objective) function. 
The fitness of a chromosome determines its ability to 
survive and produce offspring [17]. As reproduction 
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takes place, the crossover operator exchanges parts of 
two single chromosomes to produce children and the 
mutation operator changes the gene value in some 
randomly chosen location of the chromosome [10]. 
The process of evaluation, selection, and 
recombination is iterated until the population 
converges to an acceptable solution.Genetic 
Algorithms (GA) has been applied in various areas of 
computer vision such as weights optimization of 
artificial neural networks [2;10;14;15], video 
segmentation [4], 

3 SVM parameter selection based on Genetic 
Algorithm 

Using the Genetic algorithm, the two parameters C 
and a of SVM model can be optimized. The process 
of optimizing these parameters is shown in Figure 2. 



Figure 2 Flowchart of GA for SVM parameters 
optimization. 

a. Initialization. To start with, the initial population 
is made up of chromosomes chosen at random or 
based on heuristically selected strings. Population 
size affects the efficiency of the performance of a 
GA. The first step in GA implementation is the 
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determination of a genetic encoding scheme, that 
is, to denote each possible point in the problem’s 
search space as a characteristic string of defined 
length. The genes represent the value of c and C 
and are concatenated to form a chromosome. 
Several of these chromosomes are randomly 
generated from the network architecture to form 
the initial population of the solution space. Figure 
3 shows a typical chromosome. 


01010 


10011 


work, the roulette wheel method is adopted for 
selection. The probability of selection is given by: 


Pi = 


i _ ft 
2?Lo fi fsum 


(19) 


in which; f is the fitness value of individual i,f sum is 
the total fitness value of population; P t is the 
selective probability of individual. It is obvious that 
individuals with high flexibility values are more 
likely to be reproduced during the next generation. 


Figure 3: Sample Chromosome Encoding of C and a 
of the RBF using strings. 

In this research work, C and a values of the RBF are 
simultaneously coded as one gene. Each of the gene 
is coded using randomized binary numbers of length 
5. The minimum value is 0.1 while the maximum 


d. Perform Crossover. In this research work, 
one-point crossing is adopted. The specific operation 
is to randomly set one crossing point among 
individual strings. When crossing is executed, 
partial configuration of the anterior point and 
posterior point are exchanged, and this gave birth to 
two new offspring 


value is 100. The number of bits to represent the 
value of a gene must satisfy: 


v > log 2 (■ 


maxo/ X£-mino x- { 


Ax 


i+l);i=l,2.3„ 


(17) 


where v is the string length, min of x t is the 
minimum value of the gene, max of x t is the 
maximum value of the gene and Ax is the error 
tolerance. 


A chromosome represents one RBF. The length of 
chromosome is obtained by concatenating the bits 
representing each gene as shown in Figure 3. 


e. Mutation. As for two-value code strings, 
mutation operation is to reverse the gene 
values within a random number generated 
between zero and one. 

f. Termination Criterion: The termination 

condition is the maximum number of 
generations. 

Genetic control parameters dictate how the algorithm 
will behave. Changing these parameters can change 
the computational result. These parameters are 
population size, crossing probability, mutation 
probability and network termination condition. In this 
work population size N is 50, crossing probability P c 
is 0.8, mutation probability P m is 0.5, and network’s 
terminative condition is MAXGEN of 100. 


b. Fitness Function by Cross-Validation. 

When GA is applied to solve a problem, the 
definition of the evaluation function to evaluate the 
problem-solving ability of a chromosome is 
important. Since SVM works by classification, the 
classification rate is used as the fitness function. The 
Accuracy is computed as: 

Accuracy=correctly / total *100 (18) 

c. Perform Selection. The fitness of the new 
offspring is calculated and sorted in the descending 
order. So, chromosomes of highest fitness values are 
selected for the next generation. In this research 


4 Object Classification using the proposed 

GA-SVM model 

4.1 Data Acquisition 

The data used for this research work are obtained 
from videos of moving objects taken in Nigeria roads. 
The moving objects are detected using background 
subtraction algorithm and the features are extracted 
from the silhouettes of the detected objects. 

4.1.1 Object Segmentation 
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Kernel Density Estimation (KDE) is the mostly used 
and studied nonparametric density estimation 
algorithm. The model is the reference dataset, 
containing the reference points indexed natural 
numbered and has been used in [6] for foreground 
detection. The algorithm assumed that a local kernel 
function is centered upon each reference point and its 
scale parameter (the bandwidth). The common 
choices for kernels include the Gaussian and the 
Epanechnikov kernel. The algorithm is presented as 
follows. Let x 1 x 2 , ... , x Ui e R d be a random sample 
taken from a continuous, univariate density f KDE is 
given by: 

/te«=£zr-,*0?9 <20) 

k(.) is the function satisfying : 

J k{x)dx = 1 (21) 

k(.) is refered to as the Kernel, h is a positive number, 
usually called the bandwidth or window width. The 
Gaussian Kernel is given by: 

K n = (2<rlexp(-fr) (22) 

where r=||x|| 2 

The Epanechnikov kernel is given by: 

-icj 1 (d+ 2 )(i-r) if r < 1 
0 otherwise 

(23) 

where d is the dimension of feature space, c d is the 
volume of the d-dimensional sphere. 

KDE for background modeling involves using a 
number of frames (training frames) to build the 
probability density of each pixel location. The 
adaptive threshold of each pixel is found after the 
construction of the histogram. 

For every pixel observation, classification involves 
determining if the pixel belongs to the background or 
the foreground (as shown in Figure 4). The first few 
initial frames in the video sequence (called learning 
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frames) are used to build histogram of distributions of 
the pixel color. No classification is done for these 
learning frames, but for the subsequent frames 
depending on whether the obtained value exceeds the 
threshold or not. If the threshold is exceeded, 
background classification is done, otherwise 
foreground classification. Typically, in a video 
sequence involving moving objects, at a particular 
spatial pixel position a majority of the pixel 
observations would correspond to the background. 
Therefore, background clusters would typically 
account for much more observations than the 
foreground clusters. This means that the probability of 
any background pixel would be higher than that of a 
foreground pixel. The pixels are ordered based on 
their corresponding value of the histogram bin which 
relies on the adaptive threshold in the previous stage. 
The pixel intensity values for the subsequent frames 
are estimated and the corresponding histogram bin is 
evaluated with the bin value corresponding to the 
intensity determined. 


4.1.2 Feature Extraction 

The blobs extracted from videos used to extract the 
feature vectors which were are then normalized by 
dividing the vector by the sum of the lengths. These 
vectors are then used to train the support vector 
machine. 

Instead of segmenting the silhouettes into 8x8 non- 
overlapping blocks as shown in [9], radial features are 
directly calculated from the detected objects as shown 
in Figure 4. This captures the shape and a series of the 
features encodes the motion information. 

The centroid of the contour (c x , c y ) is calculated using 
Equation 24. From the centroid, a pre-defined number 
of axes are projected outwards at specified regular 
angles to the nearest edges of the contour in an anti- 
clockwise direction as shown in Figures 4. 

(c x ,c y )= ^(ZJUxt.ZE-iyt) 

(24) 
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Figure 4: The radial Distance Shape Features 
Extraction 

The distance from the centroid to its nearest edge 
along a predefined angle is then stored. This is done 
for a set of angular vectors. The dimension of each 
vector equals the number of axes being projected from 
the centroid. The vector is then normalized to ensure 
that the vector is scale invariant and the largest value 
in the vector will at this point be 1.0, which will be 
the longest of the axes projected. Figures 5a and 5b 
show the execrated features from human and vehicles 
respectively. 



Figure 5b: Typiocal Radial 1- D distance feature 
vector from Humans (left) and vehicles (right) 

Let S be the segmented object region within the 
frame, f be a line projected from the centroid to the 
object boundary at angle i to the horizontal line 
passing through the centoroid of the object, then the 
length of each line to the contour boundary of the 
object is given by: 



Figure 5a: Typiocal Radial 1- D distance feature 
vector from Humans (left) and vehicles (right) 


il=I*-c«(j>(*.0) (25) 

k and / are the co-ordinates in the x and y directions 
respectively c is the centre point and w is the contour 
boundary. / is given by ktan(0), d(. ) is a binary 
function that returns 0 or 1 . 


S(p(k,Q) 


1 if p(k, l) e S 
0 otherwise 


(26) 


where p(k, Z) is the pixel value of the object. 

The number of lines in each image containing an 
object as well as the number of neurons in the input 
layer is j where j = 1 , 2, 3, . . . ., n and n is given by n = 
360 /Omin where dmin is the smallest of the angles. 
Angular size of 10 degrees interval is used to obtain 
36 regions beginning with 10°. Thirty two (32) of 
these regions were selected as feature vectors. See [1] 
for more details of the segmentation algorithm. 
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4.2 GA-SVM Object Classification 

In this work, the training of the SVM consists of 
providing the SVM with data for vehicles and 
pedestrians. Data for each class consist of a set of n 
dimensional vectors. A Radial Basis Function 
(RBF) kernel is applied to the SVM which then 
attempts to construct a hyper-plane in the n- 
dimensional space, attempting to maximize the 
margin between the two input classes. The SVM type 
used in this work is C-SVM using a non-linear 
classifier where C is the cost hyper-parameter [2], The 
SVM is trained using ID radial signals extracted from 
the silhouettes of the labeled images of humans and 
vehicles. Given a training set of human and vehicle 
images consisting of multiple labeled images of each 
human and vehicle classes an SVM classifier is 
trained as follow: ID features data are extracted from 
the training set images to create the vector X = (xi, 
X 2 ,...,xl) for human and Y = (yi,y2, ..., yx) for vehicle 
images, where L and K are the total number of 
training images for class human and vehicle 
respectively. To train the SVM, X = (xi, X 2 , ..., xl) are 
used as the positive labeled training data and Y = 
(yi,y 2 , are used as the negative labeled training 

data. The SVM is then trained to maximize the hyper- 
plane margin between their respective classes (®i, 
0 2 ). To classify 

an object F, it must be assigned to one of the p 
possible classes (®1, ®2), in this case p takes two 
values. For each object class, ®p, an SVM is used to 
calculate this class that the object F belongs to given 
measurement vector D where in this case D is the set 
of ID radial features. 


5 Implementation of the Proposed Model 

In order to evaluate the classification performance of 
the proposed SVM model, some video data were 
gotten from major roads in Akure, Nigeria. The 
background subtraction algorithm was applied to 
detect objects from the videos. Features of these 
objects are exccted and stored in a database. The data 
are divided randomly into the 80 training dataset 333 
test dataset. The 80 training dataset consist of 40 
vehicles and 40 pedestrians. The test dataset consists 
of 100 vehicles and 223 pedestrians. The class vehicle 
is assigned 1 while the class human is assigned 0. The 
GA-SVM model is trained using [3]. In order to 


compare the classification performance of the 
proposed model with other classifiers such as the 
normal SVM, K-Means, K-NN, ANN, GA-ANN, the 
training is done with the same set of data and the 
same testing dataset is used to evaluate each classifier. 
In the proposed GA-SVM model, the SVM adopts 
Radial Basis Function as the kernel. The parameters C 
and y of the SVM are optimized using Genetic 
Algorithm. Then these optimal parameters are used to 
train the SVM model. In the normal SVM model, the 
SVM use RBF as the kernel function. The parameters 
C and y are randomly selected. In the K-Nearest 
Neighbour training, data for vehicles and pedestrians 
are provided for the K-NN. As many instances of 
vehicles and pedestrians are provided with their 
corresponding class labels. After the training is done, 
K-Nearest can then be used to classify test instances. 
To classify a new instance, the instance is supplied 
and the number of neighbors k. This k defines the 
neighborhood in which training data is consulted. The 
test data is compared with the training data. The 
nearest K- instances are checked and the majority 
nearest to k dictates the class that the instance 
belongs. In this research, k is set to 5. For the 
Classification Using K-Means Algorithm, Given a 
training set of human and vehicle classes, a K-Means 
clustering algorithm is trained using the vector X = 
(xi, x 2 ,...,x L ) for human and Y = (yi,y 2 , ..., y L ) for 
vehicle data, where L is the total number of training 
images. To train the K-means, the number of classes 
and the mean of each class is provided. K-Means now 
clusters each data item to any of the classes based on 
the Euclidean distance of the data to the mean of each 
cluster. The minimum distance from each class 
determines which cluster the data falls. Now after the 
entire instance are clustered, the means are re- 
evaluated again and the process continues until a 
given terminating condition is fulfilled. To cluster a 
new data, the distances of the new data to each of the 
evaluated cluster means are calculated and the one 
with minimum is the class of the new data. Object 
classification using ANN uses input nodes, which are 
32 in number, the hidden layer which has 32 neurons 
and the output layer which has one neuron. The 
following steps are adopted in the Neural Network 
modeling for object classification. In this approach, 
shape features extracted from the detected moving 
objects are used to train a neural network classifier in 
order to recognize human and vehicle classes. The 
Learning Algorithm used is the back propagation 
algorithm. The dimension of input data is thirty two. 
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The network has one hidden layer having thirty two 
neurons and one output neuron. The network is 
trained using the back-propagation. This is a 
supervised learning algorithm in which the input 
vectors are supplied together with the desired output. 
The back propagation algorithm (BPN) learns during 
the training epoch. Several epochs are needed before 
the network can sufficiently learn and provide a 
satisfactory result. The number of epochs used is 
500, with the momentum of 0.5 and learning rate of 
0.3. The initial weights are randomly initialized to 
small random numbers less than 1 using random 
number generators. The object classification using 
using neuro-genetic model (GA-ANN) applies genetic 
algorithm to optimize the weights before the ANN is 
trained with the optimized weight. In this case, there 
are (32*32+32) number of weights which signifies a 
chromosome. The population size N is 50, crossing 
probability P c is 0.8, mutation probability P m is 0.015 
and network’s terminative condition is MAXGEN of 
100. This combination gives an excellent empirical 
performance. 

6 Results and discussion 

Table 1 shows the analysis result for GA-SVM, 
SVM, GA-ANN, K-MEANS, K-NN. GA-SVM. For 
the GA-SVM, 80 data are used to train the model.. 
The support vectors’ values from the training are then 
saved. The gamma parameter obtained is 19.28 and 
the C value is 6.49. For testing, 333 data items 
consisting of 1 10 vehicle data and 223 pedestrian data 
are used. Out of the 110 vehicles, all are classified 
correctly while for 223 human class data items, only 2 
are misclassified. Error in classification is 0.61% and 
the accuracy is 99.39%. For the normal SVM, error in 
classification is 0.9% and the accuracy is 99.1%. For 
GA-ANN, error in classification is 0.9% and the 
accuracy is 99.1%. For K-MEANS, error in 
classification is 1.2% and the accuracy is 98.8% and 
for ANN, error in classification is 1.5% and accuracy 
is 98.5% and for K-NN, error in classification is 4.8% 
and accuracy is 95.2%. Figure 6 shows the 
classification accuracy. From the figure, it is showned 
that the proposed model has the highest classification 
accuracy. 
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Figure 6: Classification Accuracy of the classifiers 
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7 Conclusion 

In this paper, SVM with GA is applied to object 
classification in videos. In the proposed model, an 
optimized RBF kernel function and parameter C of 
SVM is obtained. Video data of moving objects have 
been used to evaluate the model. The experimental 
results show that the proposed classifier performs 
better than the SVM in which parameters are chosen 
randomly. The comparative analysis of the model 
with that of SVM, GA-ANN, K-MEANS, K- 
NN.ANN shows that the model shows more excellent 
performance in terms of classification accuracy. In 
future other optimization techniques will be looked 
into in order to investigate the performance of the 
SVM model on other optimization techniques. 

References 

[1] Akintola K.G, Tavakkolie A. (2011). Robust 
Foreground Detection in Videos using 
Adaptive Colour Histogram Thresholding and 
Shadow Removal. Proceedings of the 7 th 
International Symposium on Visual 
Computing, Vol. part ii JSVC’l 1, pp. 496-505, 
Berlin, Heideiberg, Germany, September, 
2011 . 

[2] Akintola K.G., Olabode O. (2014) A Neuro- 
Genetic Model For Moving Object 
Classification In Video Surveillance Systems. 
International Journal of research in Computer 
Applications and Robotics Vol. 2, No8 pp. 
143-151, July, 2014. 

[3] Chang C.C and Lin, C. J. (2001) LIB SVM: A 
Library for Support Vector Machines. 
Software available at 
http://www.csie.ntu.edu.tw/~cjlin/libsvm . 

[4] Chiu P., Girgensohn A., Polak W., Rieffel, E, 
Wilcox L. (2000). A Genetic Algorithm for 
video Segmentation and Summarization. 
Proceedings of IEE International Conference 
on Multimedia and Expo, Vol. 3, pp. 1329- 
1332, New York, 30 th July-2 nd August, 2000. 

IJTSRD | May-Jun 2017 
Available Online @www.ijtsrd.com 


[5] Collins, R. T., Lipton, A. J., Fujiyoshi, H., and 
Kanade, T. (2000). A System for Video 
Surveillance and Monitoring: VSAM Final 
Technical Report, CMU-RI-TR-00-1 2, 
Robotics Institute, Carnegie Mellon 
University, May 2000. 

[6] Elgammal A., Duraiswami R, Harwood D.,and 
Davis L.S. (2002). Background and 
Foreground Modeling Using Nonparametric 
Kernel Density Estimation for Visual 
Surveillance. Proceedings of the IEEE Vol.90, 
No.7, pp. 1151-1163, JULY 2002. 

[7] Heikkila J. and Silven O. (1999) A real-time 
system for monitoring of cyclists and 
pedestrians. Prococeedings of Second IEEE 
Workshop on Visual Surveillance, pp. 74-81, 
Fort Collins, Colorado, June 1999. 

[8] Hochreiter S., (2009). Theoretical 

Bioinformatics and Machine Learning Lecture 
Notes. Institute of Bioinformatics, Johannes 
Kepler University A-4040 Linz, Austria. 

[9] Modi R. V. and Mehta T. B. (2011). ‘Neural 
Network-Based Approach for Recognition of 
Human Motion Using Stationary Camera’. 
International Journal of Computer 
Applications, Vol. 25, No. 6, pages 975-887. 

[10] Montana D.J., and Davis L.D. (1989). 
Training Feed-Forward Networks using 
Genetic Algorithms. Proceedings of the 
International Joint Conference on Artificial 
Intelligence, IJCAI-89. Morgan Kaufman 
Publishers, Deutrot, MI, pp. 762-767. 

[11] Michael P. and El-Kaliouby R. (2003) Real 
time facial expression recognition in video 
using Support Vector Machine in proceedings 
of ICM Nov 5-7, 2003 Vancour, British 
Columbia, Canada. 

[12] Ramya G and Srclatha M. (2014) Multiple 
Object Tracking using Support Vector 
Machine. Global Journal of Researches in 
Engineering Subtraction Vol. 14 Issue 6 
Version 1. 

162 


International Journal of Trend in Scientific Research and Development, Volume 1(4), ISSN: 2456-6470 

www.ijtsrd.com 


[13] Ravindra M Chandraslekhar G. (2014) 
Classification of Satellite Images based on 
SVM 

Classifier using Genetic Algorithm. 

International Journal of Innovative Research 

in Electrical Electronics Instrumentation and 

Control Engineering Vol. 2 Issue 5, May 2014 

[14] Rilley J., Ciesielski V.B. (1998). An 

Approach to Training Feed-Forward and 
Recurrent Neural Networks. Proceedings of 
the Second International Conference on 
Knowledge-Based Intelligent Electronic 
Systems (KES’98), pp. 596-602, Adelaide, 
April 1998, USA. 

[15] VishwaKarma D.D. (2012). Genetic 
Algorithm Based Weights Optimization of 
Artificial Neural Network, International 
Journal of Advanced Research in Electrical, 
Electronics and Instrumentation Vol. 1. No 
3, August 2012. 

[16] Vapnik V,N.(1995). The Nature of Statistical 
Learning Theory. Springer Verlag, New York, 

1995. ISBN, 0-387-94559-8. 

[17] Wainwright R.L. (1993). Introduction to 

Genetic Algorithms Theory and Applications. 
The seventh Oklahoma Symposium on 
Artificial Intelligence. November, 19,1993. 
The 

University of Tilsa Cao Smith College 
Avenue, Tulsa Oklaoma. 


163 

IJTSRD | May-Jun 2017 
Available Online @www.ijtsrd.com 


